Adapting Self-Organizing Maps to the MapReduce Programming Paradigm

نویسنده

  • Christian Weichel
چکیده

We present an adaption of the self organizing map (SOM) useful for cluster analysis of large quantities of data such as music classification or customer behavior analysis. The algorithm is based on the batch SOM formulation which has been successfully adopted to other parallel architectures and perfectly suits the map reduce programming paradigm, thus enabling the use of large cloud computing infrastructures such as Amazon EC2.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A GPU-accelerated algorithm for self-organizing maps in a distributed environment

In this paper we introduce a MapReduce-based implementation of self-organizing maps that performs compute-bound operations on distributed GPUs. The kernels are optimized to ensure coalesced memory access and effective use of shared memory. We have performed extensive tests of our algorithms on a cluster of eight nodes with two NVidia Tesla M2050 attached to each, and we achieve a 10x speedup fo...

متن کامل

An ANALYSIS on VARIATIONS of INPUT PATTERN DISTRIBUTIONS in SELF-ORGANIZING MAPS in 2D

Self-organizing mapping is an unsupervised learning paradigm used in pattern classification and hence artificial intelligence. This paradigm is based on modifying the class features via the incoming input stimuli. Its exciting part is that it introduces concepts such as neighborhood or mapping. Hence the results obtained from this paradigm highly depend on the selected neighborhood and mapping ...

متن کامل

Visual mining in music collections with Emergent SOM

We describe different ways of organizing large collections of music with databionic mining techniques. The Emergent Self-Organizing Map is used to cluster and visualize similar artists and songs. The first method is the MusicMiner system that utilizes semantic descriptions learned from low level audio features for each song. The second method uses tags that have been assigned to songs and artis...

متن کامل

Green Product Consumers Segmentation Using Self-Organizing Maps in Iran

This study aims to segment the market based on demographical, psychological, and behavioral variables, and seeks to investigate their relationship with green consumer behavior. In this research, self-organizing maps are used to segment and to determine the features of green consumer behavior. This was a survey type of research study in which eight variables were selected from the demographical,...

متن کامل

Classification of Streaming Fuzzy DEA Using Self-Organizing Map

The classification of fuzzy data is considered as the most challenging areas of data analysis and the complexity of the procedures has been obstacle to the development of new methods for fuzzy data analysis. However, there are significant advances in modeling systems in which fuzzy data are available in the field of mathematical programming. In order to exploit the results of the researches on ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010